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ON A NONHIERARCHICAL VERSION OF THE GENERALIZED 
RANDOM ENERGY MODEL^ 

By Erwin Bolthausen and Nicola Kistler 

Universitdt Zurich 

We introduce a natural nonhierarchical version of Derrida's gen- 
eralized random energy model. We prove that, in the thermodynami- 
cal limit, the free energy is the same as that of a suitably constructed 
GREM. 

1. Introduction and definition of the model. The generalized random 
energy model (GREM for short), introduced by Derrida [1], plays an impor- 
tant role in spin glass theory. Originally invented as a simple model which 
exhibits replica symmetry breaking at various levels, it has become clear that 
more interesting models, like the celebrated one of Sherrington-Kirkpatrick, 
exhibit GREM-like behavior in the large N limit. Despite the spectacular 
recent progress in understanding the SK-model (see [4, 5, 7]), many issues 
have not been clarified at all, the most prominent one being the so-called 
ultrametricity. [A metric d is called an ultrametric if the strengthened tri- 
angle condition holds: d{x,z) < msix{d{x,y),d{y,z)). Equivalently, two balls 
are either disjoint or one is contained in the other.] The GREM is of limited 
use to investigate this because it is hierarchically organized from the start. 
This favorable situation allows for a complete solution, fully confirming the 
so-called Parisi theory (we refer the reader to the detailed study [2] where 
it is also pointed out that, interestingly, the emerging ultrametricity of the 
Gibbs measure does not necessarily coincide with the starting hierarchical 
organization). Yet, from the considerations on the GREM, one gets little clue 
on why many systems should be ultrametric in the limit. [In Talagrand's re- 
cent proof of the Parisi formula, ultrametricity plays no apparent role, and 
it seems to be quite delicate to prove ultrametricity by Talagrand's method. 
This is quite curious as, on the other hand, ultrametricity plays a crucial 
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role in the physicists nonrigorous derivation of the free energy, be that using 
the repUca trick or the cavity method.] 

We present here a simple and, as we think, natural generalization of the 
GREM which has no built in ultrametric structure. We, however, show that, 
in the limit, the model is ultrametrically organized. In this paper we address 
only the free energy. The more delicate investigation of the ultrametricity 
of the Gibbs distribution will be investigated in a forthcoming paper. 

Throughout this paper, we fix a number n G N, and consider the set 
/ = {1, . . . as well as a collection of positive real numbers {aj}j^j such 
that 

Jci 

For convenience, we put 00*== 0. The relevant subset of / will be only the 
ones with positive a-value. For ^ C /, we set 

VA = {JcA:aj>0}, V = Vi. 

For n G N, we set S^v "== {1, . . . , 2^}. We also fix positive real numbers 7^, 
i G /, satisfying 

n 
i=l 

and write =^E^.7v where, for notational convenience, we assume that 
2'^'-^ is an integer. For G N, we will label the "spin configurations" a as 

fj = (cTi, . . . ,cr„), (Jj G SJv, 

that is, we identify E^v with Sjy x • • • x S^y- For J C I, J = {ji, . . . ,jk}, ji < 

j2 < • ■ • < jk, we write Sat^j =^ns=i ^/J) ™d for o" G S^r, we write aj for 
the projected configuration {aj)j^j G S7v,j- Our spin glass Hamiltonian is 
defined as 

(1) ^<^ = E^^'.' 

JG-P 

where X^^, J ^ V, aj ^ '^n,j are independent centered Gaussian random 
variables with variance ajN. The X^^ are then Gaussian random variables 
with variance (Gaussian always means "centered Gaussian" through this 
note), but they are correlated. E will denote expectation with respect to 
these random variables. A special case is when V = {/}, that is, when only 
aj 7^ 0, in which case it has to be one. Then the X„ are independent, that 
is, one considers simply a set of 2^ independent Gaussian random variables 
with variance A^. This is the standard random energy model. 
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The generalized random energy model is a special case, too: It corresponds 
to the situation where the sets in V are "nested," meaning that V consists of 
an increasing sequence of subsets. Without loss of generality, we may assume 
that in this case 

(2) V = {Jrn:l<m<k], Xj={l,...,nm], 

where 1 < ni < 77-2 < • • • < < n. In the GREM case, the natural metric on 
S^r coming from the covariance structure 

is an ultrametric. In the more general case (1) considered here, this metric 
is not an ultrametric. 

To see this, take n = 3, "P = {{1, 2}, {1,3}, {2, 3}}, that is, where 

(3) = + 4,^:8 

with aj = 1/3 for J G V. Then for a, 6, 6', c, c' G "^n/?,-, ^ 7^ ^'1 c / c', one has 
d((a, 6, c), (a, 6, c')) = d((a, 6, c'), (a, 6', c')) = \/2iV/3, 

whereas 

d((a,6,c),(a,6',c')) = ViV, 

contradicting ultrametricity. 

Any of our models can be "coarse-grained" in many ways into a GREM. 
For that consider strictly increasing sequences of subsets of 1 :0 = Aq C 
Ai C • • ■ C Ak = I- We do not assume that the Ai are in V. We call such a 
sequence a chain T = (Aq, Ai, . . . , Ak)- We attach weights clAj to these sets 
by putting 

(4) 0^/= as- 

Evidently J2f=i ^Aj = Ij and if we assign random variables Xo-(T), according 
to (1), we arrive after an irrelevant renumbering of / at a GREM of the 
form (2). In particular, the corresponding metric d is an ultrametric. 

We write tr(-) for averaging over S^v (i.e., the coin-tossing expectation if 
we identify S^v with {H,T}^). 

For a function x : S/yr ^ K, set 

Zn iP, x) =^ tr exp[/3x] , Fn {(3, x) =^ log(ZAr (/?, x) ) , 
and define the usual finite partition function, and free energy by 
ZM = Zm{P,X), Fn{P) = Fn{I3,X), fN{(3)=nFN{l3,X)), 
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where X is interpreted as random function S^v — > 

For any chain T, we attach to our model a GREM (Xo-(T))o-eEjv ' 
explained above, and then 

/;v(T,/?)1i'E(F^(/3,X(T))), 

/(T,/3)1^' hm /^(T,/3). 

For a GREM, the limiting free energy is known to exist, and can be expressed 
explicitly, but in a somewhat complicated way (see [1, 3]). Our main result 
is that our generalization of the GREM does not lead to anything new in 
— > oo limit, shedding hopefully some modest light on the "universality" 
of ultrametricity. 



Theorem 1. 

(5) = lim fNiP) 

N^oo 

exists and is also the almost sure limit of Fn{[5). 

f{j3) is the free energy of a GREM. More precisely, there exists a chain 
T such that 

(6) /(/5) = /(T,/3), /3>0. 
/(T,/3) is minimal in the sense that 

(7) /(/3)=mm/(S,/?), 

the minimum being taken over all chains S. 



The fact that free energy is self- averaging, meaning that (if the limit 
exists) is also the almost sure limit of the F/v, is a simple consequence of 
the Gaussian concentration inequality. We write -F/v as a function of the 
standardized variables X^^ / \J a jN. As 



log^e"' -log^e< 



<max|ai — a'l, aj,a'GM, 



we get that Fj\i{P), regarded as a function of the collection {X^ J \JajN), is 
Lipschitz continuous with Lipschitz constant j3/ ^/N . By the usual concentra- 



tion of measure estimates for Gaussian distributions (see, e.g.. Proposition 
2.18 of [6]), we have 



(8) 



P[|F;v(/3)-EF;v(/?)| >e]<2exp 



2/32 



NONHIERARCHICAL GREM 



5 



Using the Borel-Cantelli lemma, one sees that if lim7v-»oo fNif^) exists, then 
the F]y{l3) converge almost surely to this limit, too, and if lim]\f^oo Fn{P) 
exists almost surely, then the limit is nonrandom and equals limjv-^oo fNiP)- 

As for the strategy of the proof, the existence of the limit is established 
through a quite standard application of the second moment method, akin to 
that originally exploited by Derrida in his seminal paper [1]; this allows to 
express the limiting free energy in terms of a variational problem, which we 
then solve inductively. 

For the reader's convenience, we briefly describe the mechanism which lies 
behind Theorem 1 for the Hamiltonian (3), but we allow for general (pos- 
itive) variances 012,013,023, and general 7^. It is best to count the number 
of configurations a which reach a certain energy level AA^. It is evident that 
only an exponentially small portion of the total number 2^ of configurations 
achieve this, roughly formulated (we will be more precise later), 

#{c7:X,~AiV}~2^e-''(^)^, p(A) > 0. 

The free energy is obtained by the Legendre transform of p. In order to 
determine p(A), we count individually for each of the three parts in (3) how 
many configurations reach respective levels 

p(Ai, A2,A3) 

(9) 

^--log#{a: AiJ;2} ^ x^N, Xj^jg ^ X2N, Xj^.g ^ A3 A^} + log 2, 
with Ai + A2 + A3 = A. Evidently, 

(10) /'(^) = .^i"f /i(Ai,A2,A3). 

Ai+A2+A3=A 

It turns out that one can get p by computing expectations inside the log- 
arithm, provided only some naturally defined restrictions on the Aj are sat- 
isfied. For small A, it is easily seen that one has an "equipartition" property, 
and that the optimal Ai, A2, A3 are proportional to the respective variances, 
that is, Ai = ai2A, A2 = 013A, A3 = a23A, and from that one obtains 

(11) p(A)=AV2, 

which is the same as if the Xu would be uncorrelated. Increasing A, we, 
however, encounter restrictions from the structure of the Hamiltonian. First 
of all, Ai has to be such that there are any cri,cJ2 with Xa^lai — AiA^ = 
ai2AA^. There are 2'''~'^^'^'^'>^ pairs (ci, (T2), and as the xilla\ are independent, 
the restriction is 

2(^^+^^)^exp[-^^^l>l. 
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(We are not considering any log-corrections.) This leads to the restriction 
(12) A< 



'2(71 +72) log 2 



0-12 

for the validity of (11), and there are two similar restrictions coming from 
X^^'^^ and X^^'^\ Even if these three restrictions are satisfied, it can be 
that there are simply totally not enough triples (cxi, a"2, (T3) left. A necessary 
condition for this is certainly that the expected number of #{(t : X(j ~ AA^} is 
not exponentially decaying, which is simply the condition that A < ^/2]Eg2. 
The somewhat astonishing fact is that these are the only conditions one has 
to take into considerations for the validity of (11). Now, there are two cases: 



Case 1. A < v^2Tog2 implies the other ones, that is, 

(13) min ll±^>l. 

i<«<j<3 aij 

In that case, we are simply left with the restriction A < \/21og2, and the 
free energy is 

/(/?)= sup (/3A-AV2), 

A<^21og2 

which is the free energy of an REM. In that case the internal structure of 
the model is irrelevant, at least for the free energy. 

Case 2. (13) is violated. For definiteness, assume that (71 +72)7012 is 
the smallest one. 



In that case, (11) is only correct in the region (12). For A larger, there is 

•fl 2j 

no (cTi, CJ2) with Xa^\cT2 — 0-12^^ (with probability close to 1), the maximum 

•Tl 2} 

of the Xai,a2 being at mi2N (it log-corrections), where 



mi2 =^ y^2(7i +72)012 log 2. 

Therefore, one has to restrict in (10) to A's with Ai = mi2. The only config- 
urations a for which X^ ~ AA^ have to satisfy 

(14) Aii;2}:^mi2Af, 

but there are now only subexponentially many ((Ti,cj2) left which achieve 
this feat, and the difference to AA^ has to be made by the field 

Y^,,^2{a,f^xi]^^^+Xll% 1 < .3 < 2r^\ 
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restricting ((Ti,cj2) to the few which satisfy (14). There is an upper limit 
Amax for A's such that there are any as with Y^as) ~ (A — mi2)N. Amax-^ — 
mi2N is simply the maximum of 2'*'^'^ independent Gaussians with variance 
(ai3 + a2z)N, that is, 

Amax - mi2 ^= Y/273(ai3 + a23)log2. 

The situation is similar to the one in the GREM with the only difference that, 
for (<Ti, CT2) 7^ (f'l, C72), the fields ^0-1 ,0-2 aiid Ya'^y^ ^re not independent, except 
when o"! 7^ o'l and (T2 7^ cr2- It is, however, fairly evident that, among the 
{<^i-,(^2) for which X^i^ai — "^12^ there will be no pairs with such a partial 
overlap, with probability close to 1, and therefore, it is quite natural one can 
handle the field ^0-1,0-2 if it would come from a second level of a two- level 
GREM. In fact, it turns out that, in the Case 2, the tree of Theorem 1 
is {{1,2}, {1,2, 3}}, and we replace our model with the coarse grained one 
with Hamiltonian X'^^ + X'^^^^^, where #ai = 2(^1 +^2)^, var(X') = auN, 
#a2 = 2^^^, var(X") = (013 + a23)A^. 

This way of reasoning works for the general case. There are two issues 
which might be somewhat surprising. The first is that expressions (9) can 
always be evaluated by computing expectations inside the logarithm, pro- 
vided one keeps some fairly trivial restrictions on the Aj. Second, it is not 
entirely evident why these restrictions finally always lead to tree structures. 

It is also interesting that the system always chooses from the many 
GREMs which can be obtained by coarse-grainings the one with minimal 
free energy. A similar behavior has already been obtained for the GREM 
itself in [2]. 

2. Second moment estimates. We fix some notation: If (aAr)ArgN and 

(67v)a'gn are two sequences of positive real numbers, we write oat x bN if, 
for all £ > 0, there exists Nq^e) £ N such that 

e-'^bN <aN< e-'^bN 

for N > No. We also write a^v ^ b^ if, for some 6 > 0, one has a^v < b^e'^^ , 
again for large enough A^. In that case, we also write = i}{bN). The same 
notation are used in the case of sequences of random variables, just meaning 
that the relations hold almost surely (and therefore Nq may depend on lv). 
For Ag I (not necessarily in V), we set 

We rewrite F/v in terms of energy levels. For a collection A = {\j)j^-p, Xj £ 
M and Ac I, we set 

Mn,aW = #W G ^N,A : > XjN, yJeVA}, 
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AfN{X)=AfN,i{X). 

Clearly, 

(15) {Mn,aW = 0} C {Mn{X) = 0}. 

We express Fj\[ in terms of the A/jv(A): 



1 



FAr(/3) = ^log2-^(/3iV) 



\v\ 



(16) 



4 log / dXAfN{X)Y[e^^-'^ -log2 + 



logN 
N 



We first want to take out the A for which A/7v(A) = for large N. As these 
are integer valued random variables, it is clear that EA/7v(A) <^ 1 implies 
^/^(A) = for large enough N, almost surely. It, however, turns out that 
this condition is not sufficient for our purpose, but remark that, if for some 
A C I, one has EA/'Ar,yl(A) ^ 1, then, by (15), one has A/'Ar(A) = for large 
enough N as well. 

Lemma 2. (a) For any X G and Ac I, we have 



exp 



E 

JeVA 



(All! 

2aj 



N 



def 



where Xj = max(Aj,0). 

(b) There exists C > such that 



EA/W(A)<C2^exp 



y^(A2! 
^ 2aj 



N 



for all A G M^, and all N. 

(c) Let X G M^. If for some Ac I one has 

Y: ^>7(^)log2. 



JeVA 



2oj 



then P(A/'Ar(A) 7^ 0) ^ 1 and, in particular, J\fN{X) = for large enough N, 
almost surely. 

Proof, (a) and (b) follow by standard Gaussian tail estimates, and in 
case (c), by (15), we have 

F{J\fN{X) / 0) < F{Mn,aW 7^0)< EMn,aW^ 
which proves (c). □ 
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dcf 



A+ ='{Ag A:Aj>0,VJgP}. 
Lemma 3. If Xe int A, then 
(17) A/W(A) X EA/W(A) X exp 



AT log2-^(A+)V2aj 



Proof. The second relation is Lemma 2(a). For the proof of the first, 
it suffices to show that A G int A imphes 

(18) varAA^(A) < (EA/'jv(A))2. 

In fact, from (18), Chebyshev's inequahty and the Borel-CanteUi lemma 
immediately imply (17). 

We abbreviate ^{X'l^^ > NXj) by pj{N) (A is kept fixed through this 
proof). With this notation, W.Mn{X) = 2^ Y{j^j,pj{N): 

^n{\? = e n ip«7 ^ ^i'j ^ ^^^) 

= E E n nxi, > NXj, X/, > NXj) 

ACI {a,a')eAA JeV 

= E iAA(iv)i n pj(^) n pa^)'^ 

Acl JgPa J&V\Va 

where Ka{N) consists of those pairs (c, o"') which agree on A and disagree 
on I\A. For ^ = 0, 2^^ -\Kis{N)\<^ 2^^, and therefore, 

Y^iMN{x) = Y.\KAmX{p.m n pj{Nf+^{mN{x)f). 

Ai^z J&Va J&V\Va 



\Ka{N)\ = 2^(^)^ n 2^'^(2^'^ - 1) = 22^2"^(^)^ + J^(|AA(iV)|). 
As by assumption, 

„ r „ 1 



2-7(A)JV« n Pj(^^)>^exp 
J&Va 



J&Pa 



2aj 



we have, for any A^ 0, 

\AAm U Pji^) n PjiNf<.2'''l[pjiNf 



10 E. BOLTHAUSEN AND N. KISTLER 

proving varA/WCA) < (EA/XA))^. □ 



Let 

(19) iAx,P) = j:(f^^j-^ 

Proposition 4. The free energy as defined in (5) exists and is given as 

(20) /(/?)= sup ^(A,/3). 

aga+ 

Proof. We show the lower bound for hminfAr^oo -^Af (/?) and the upper 
bound for hmsupjv_>oo InW)- By the self-averaging property (8), this proves 
the statement. 

We use the integral representation (16). If = {nj)-, v = iyj) satisfy /ij < 
vj for all J, we write 

[^,z.) ='{A://j<Aj<i^j,VJGP}. 
If [/i, I') C A+, we have 

liminf F^(/3) > liminf ^ log / d\MN{\) TT e^^^^ 
> liminf — log A/'Ar(i^) / dA TT 



As this holds for arbitrary [^u, u) C A"*", liminf at -»oo Fn{(3) > sup;^g^+ ip{X,(3) 
follows. 

For the upper bound, let be an e-neighborhood of A. From Lemma 
2(c), we have 

^(PNiP) ^log 1^ dAAr;v(A)e^S,7^^^j « 1, 
and therefore, 

EFjv(/3) - ^Elog / dXMN{X)e^^J^^^ < 1. 

N J A, 

By Jensen's inequality and Lemma 2(b), we have 
limsup^Elog / dXArN{X)e^^J^'^ 
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<limsup4log/ dXEAfN{X)e'^^J^'^ 

< lim sup — log 2^ / dX exp NY (^Xj- 

N^oo N J A, [ 2a J J 

< sup ^(A,/?). 
AeA^ 

As e > is arbitrary, limsnpj^_^^E.Fj\f{P) < sup;^g^+ ^{X,P) follows. □ 

3. The optimization problem. We first discuss the special case of a GREM. 
Therefore, we assume that the sets in V are nested, that is, V = {Ji, . . . , Jm}, 

where C Ji C • • • C J^. If ^ C /, put /a*== max{/ : J/ C A}. Evidently, 

Y: ^< log 2 7(A) 



follows from 



lA ^2 



E^<log2 7(J,J. 

i=l 

Therefore, A G A"*" is equivalent with 

I ^2 

E^<log2 7(^0, l<l<m, 

(and, of course, that all components are nonnegative) . Therefore, we have 
proved the following: 

Lemma 5. Assume that V is nested as above. Then 

r I )2 \ 

fiP) = sup|v(A,/3) :E ^ < log2 7(J0, 1 < / < m|. 

This lemma proves that, in our more general situation, for any chain T, 
the corresponding GREM free energy is an upper bound. 

Corollary 6. For any chain T, we have 

/(/?)< /(T,/3), (3>0. 

Proof. For a given chain = Aq C Ai C A2 C ■ ■ ■ C Ak = /, we con- 
sider l^'^ which is obtained by dropping the conditions for the A's which are 
not in the chain. Then 

/(/?)= sup ^l^{X,(i)< sup ^l,{X,(i). 
AGA+ AeA+ 
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We claim that 



sup ^(A,/?)=/(T,/3), 



T 



which proves the corollary. To see this equation, we write 

V'(A,/3)=E E =EV',(Ai,/3), say, 



where Aj =^(Aj) jgp^ _^ . Set 



t) "^=2:^ supj V', ( A, , /3) : E ^ = * [ = '^V'^*"^' " 

where aj =^ J2jeVA ^ fj(f^^ /2aj) = /3s — /2aj. We there- 

fore see that 



sup '(/'(A, /?) = sup 



AGA+ 



{ti,(>s,-§-}-ti-^<^o^2,(A,),l<l<K^ 



= /{T,/3), 

the last equality by Lemma 5. □ 

In order to finish the proof of Theorem 1, it only remains to construct a 
chain T which satisfies /(/3) > /(T,/3). Then one has also equality by the 
above corollary. 

For B C /, let 

/ def 

a 



(5) = E --J 

J&Vb 



and for B d A, set 



dcf 21og2(7(^)-7(i?)) 
^^^'^) = V a{A)-a{B) ' 

p(S)=^ min 

We construct a strictly increasing sequence of subsets *== C j4i C 

^2 c • • • C = / and parameters /3o = < /3i < < • • • < /Sii' < oo by 
recursion. Assume that C Ai C A2 C ■ ■ ■ C and < /3i < /32 < • • • < 
are constructed such that the following conditions are satisfied: 
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cm pj=p{Aj_i), j<k. 

C2(k) For j < k and any A D ^j-i which satisfies Pj = p{Aj-i,A), one has 
Ac Aj, that is, Aj is maximal with Pj = p{Aj-i,Aj). 

For k = 0, the conditions are void. If A^ = I, then the construction is 
finished, and we have K'^ k. Therefore, assume Ak 7^ /. Then we set Pk+i '= 
p{Ak), and prove first that Pk+i > Pk- We claim that, for any Ad Ak, A^ 
Ak, one has 

2log2{^{A)-jiAk))>Plia{A)-aiAk)). 

Indeed, because of 2\og2{'y (Ak) - 'y{Ak-i)) = Plia{Ak) - a{Ak-i)), 

21og2(7(^) - j{Ak)) < Pl{a{A) - a{Ak)) 

would contradict condition C\{k) and equality would contradict C2[k). 

It only remains to construct AkJ^i which satisfies C2(A; + 1). Assume there 
are two sets A, A' D A^, A, A' ^ Ak satisfying 

(21) p{Ak,A)=p{Ak,^) = l5k+i. 

We claim that then also p{Ak,A U A') = (3k+i- Remark that 
a{A U A') > a{A) + a{A') - a{A D A'), 
-f{A U A') = j{A) + 7(^0 - j{A n A'), 

and therefore, 

21og2(7(^ U A') - ^(Ak)) - PUi{a{A U A') - a(Afc)) 

< 2 log 2[7(A) + 7(A') - 7(A n A') - j{Ak)] 

- pl+i[a{A) + a{A') - a{A n A') - a{Ak)] 

= Pl+M^ n A') - a{Ak)] - 21og 2[7(A n A') - ^{Ak)] < 0, 

the equality by (21), and the last inequality by the definition of (3k+i- From 
the definition of (3k+i, we therefore conclude that 

21og2(7(^ U A') - j{Ak)) = Pl+i{aiA U A') - a{Ak)). 

We therefore find a unique maximal set Ak+i D Ak which satisfies 
p{Ak,Ak+i) = Pk+i, and so we have constructed Pk+i > Pk, ^fc+i ^ Ak, 
Ak+i 7^ Ak such that Cl{k + 1) and C2{k + 1) are satisfied. The construc- 
tion terminates after a finite number of steps. 

We claim now that, with T (0, Ai, . . . , Ak^i,I), we have 

(22) /(/3) >/(/?, T). 
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Clearly, if (3 > is small enough, the maximum in (20) is attained in 
Aj ^(/?) *== aj/? for all J, and therefore, 

0^ 

fiP) = Y 

for small (3. This remains valid as long as {P'^ /2)a{A) < 'y{A) for all A, that 
is, for (3 < (3i. For f3k < f3 < Pk+i, we choose X'^''+^\(3) defined by 

(fc+l) def / ajPrn, for J G Va,,, \ 7^A„_i , l<m<k, 



(23) xy 'iP) 



\ aj(3, for J ^ Va, 



This choice (23) satisfies the side conditions in the range of (3 we are con- 
sidering, and hence, 

(24) ^(A('=+^)(/3),/3) </(/?). 

We show now that /(/?, T) = ^(A^^+i) {(3), (3) for 13k < (3 < Pk+i- An elemen- 
tary computation gives 

V^(A('=+^) (/?),/?) =pY.(^M^i) - - log2 + ^(1 - aiAk)) 

1=1 ^ 

= /3^Aa(A,)-7(^fc)log2 + ^ ^ a(A,), 

i=l j=fc+l 

where a{Ai) is defined by (4). This is exactly the free energy of the cor- 
responding GREM as given in [1]. [It is, in fact, elementary to check that 
A('=+i)(/3) is the maximizing vector A for the GREM corresponding to the 
above chain when [3k< (3 < We have therefore proved Theorem 1. 

Remark 7. We have, in fact, proved that 

f{l3) =(3Y,(3i[a{Ai) - a{Ai_i)] - ^{A^) log2 + ^(1 - a{Ak)) 
for /3fc</3</3fc+i. 
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